Reinforcement Learning with Success Induced Task Prioritization
نویسندگان
چکیده
Many challenging reinforcement learning (RL) problems require designing a distribution of tasks that can be applied to train effective policies. This specified by the curriculum. A curriculum is meant improve results and accelerate it. We introduce Success Induced Task Prioritization (SITP), framework for automatic learning, where task sequence created based on success rate each task. In this setting, an algorithmically environment instance with unique configuration. The algorithm selects order provide fastest agents. probability selecting any next stage determined evaluating its performance score in previous stages. Experiments were carried out Partially Observable Grid Environment Multiple Agents (POGEMA) Procgen benchmark. demonstrate SITP matches or surpasses other design methods. Our method implemented handful minor modifications standard RL provides useful prioritization minimal computational overhead.
منابع مشابه
Task-Oriented Reinforcement Learning
Acknowledgement This thesis is the result of two years of work whereby I have been accompanied and supported by many people. I am extremely indebted to Dr.
متن کاملTask-Oriented Query Reformulation with Reinforcement Learning
Search engines play an important role in our everyday lives by assisting us in finding the information we need. When we input a complex query, however, results are often far from satisfactory. In this work, we introduce a query reformulation system based on a neural network that rewrites a query to maximize the number of relevant documents returned. We train this neural network with reinforceme...
متن کاملZero-Shot Task Generalization with Multi-Task Deep Reinforcement Learning
As a step towards developing zero-shot task generalization capabilities in reinforcement learning (RL), we introduce a new RL problem where the agent should learn to execute sequences of instructions after learning useful skills that solve subtasks. In this problem, we consider two types of generalizations: to previously unseen instructions and to longer sequences of instructions. For generaliz...
متن کاملBayesian Multi-Task Reinforcement Learning
We consider the problem of multi-task reinforcement learning where the learner is provided with a set of tasks, for which only a small number of samples can be generated for any given policy. As the number of samples may not be enough to learn an accurate evaluation of the policy, it would be necessary to identify classes of tasks with similar structure and to learn them jointly. We consider th...
متن کاملReinforcement Learning for Pivoting Task
In this work we propose an approach to learn a robust policy for solving the pivoting task. Recently, several model-free continuous control algorithms were shown to learn successful policies without prior knowledge of the dynamics of the task. However, obtaining successful policies required thousands to millions of training episodes, limiting the applicability of these approaches to real hardwa...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Lecture Notes in Computer Science
سال: 2022
ISSN: ['1611-3349', '0302-9743']
DOI: https://doi.org/10.1007/978-3-031-19493-1_8